The aim of this script is to analyse the usage of EAGLE vocabularies in the Epigraphic Database Heidelberg as part of the FAIR Epigraphy project and the Epigraphy.info working group tasked with cleaning up the EAGLE vocabularies.
library(tidyverse)
library(jsonlite)
If you have a locally saved dataset
EDH <- fromJSON("./data/EDH_text_cleaned_2021-01-21.json") ## 2021 version
head(EDH)
You can download the latest dataset directly from Zenodo (uncomment and run the appropriate version). !!!The structure of the data might have changed in between versions as the EDH changed its structure in 2021. This script has been written with the data from 2021 and the code might need minor edits with the data from 2022!!!
### 2021 version
# EDH <- jsonlite::fromJSON("https://zenodo.org/record/4888168/files/EDH_text_cleaned_2021-01-21.json?download=1")
### 2022 version
# EDH <- jsonlite::fromJSON("https://zenodo.org/record/7303886/files/EDH_text_cleaned_2022-11-03.json?download=1")
If you are interested in how this dataset was compiled from the EDH API and XML data dumps, see the Github repository: https://github.com/sdam-au/EDH_ETL.
EAGLE_vocabs<- read.csv("data/queryResults.csv", sep = ",")
head(EAGLE_vocabs)
EAGLE_typeins<- read.csv("data/EAGLE_typeins_all.csv", sep = ",")
head(EAGLE_typeins)
The Epigraphic Database Heidelberg (https://edh-www.adw.uni-heidelberg.de/) uses the
existing vocabularies from EAGLE Project (https://www.eagle-network.eu/resources/vocabularies/) in
the following five Epidoc XML tags: Object Type,
Keywords, Material, RS Execution,
RS Decoration.
Example of record in XML:
<support>
<objectType ref="http://www.eagle-network.eu/voc/objtyp/lod/257">Tafel</objectType>
</support>
In our processed version of the EDH dataset, all records consist of the following attributes:
Eagle Linked Open Data numeric code for vocabulary, i.e. 257, extracted from the reference within the Epidoc XML tag.
Free text description, extracted from the XML tag.
See the raw data below:
EDH_eagle<- EDH %>%
select(layout_execution, layout_execution_text, support_objecttype, support_objecttype_text, support_material, support_material_text, support_decoration, keywords_term, keywords_term_text, type_of_inscription_clean)
EDH_eagle
In 2021, the total of 346 record were missing information extracted from XML files (due to delay in availability of published data through EDH data dumps on their website). However, at least partial information about them could be extracted from the then existing EDH API. For the purpose of the current evaluation of the Eagle vocabularies are these 346 records considered as missing as we cannot access their respective XML files with vocabularies.
EDH %>%
filter(is.na(support_objecttype)) -> missingXML
missingXML
#object
EDH_eagle %>%
filter(!is.na(support_objecttype) & support_objecttype != "") %>% nrow() -> object_total
#type inscription
EDH_eagle %>%
filter(!is.na(keywords_term) & keywords_term != "" ) %>% nrow() -> typeinsc_total
#material
EDH_eagle %>%
filter(!is.na(support_material) & support_material != "") %>% nrow() -> material_total
#writing
EDH_eagle %>%
filter(!is.na(layout_execution) & layout_execution != "") %>% nrow() -> writing_total
#decoration
EDH_eagle %>%
filter(!is.na(support_decoration) & support_decoration != "") %>% nrow() -> decor_total
#how many Eagle vocabularies has been used in the EDH
Eagle_total <- object_total + typeinsc_total + material_total + writing_total + decor_total
Eagle_total
## [1] 400374
Attributes categorising the type of inscription, used 80721 times.
XML: Keywords tag, term tag.
Example of record in XML:
<keywords>
<term ref="http://www.eagle-network.eu/voc/typeins/lod/92">Grabinschrift</term>
</keywords>
EDH_eagle %>%
select(keywords_term, keywords_term_text, type_of_inscription_clean) %>%
head()
EDH_eagle
EDH_eagle$keywords_term <- as.numeric(EDH_eagle$keywords_term)
EAGLE_typeins_all<- EDH_eagle %>% dplyr::left_join(EAGLE_typeins, by=c("keywords_term"="EAGLE_lod_id"), multiple="first")
EAGLE_typeins_all %>%
select(concept, prefLabel, keywords_term_text, type_of_inscription_clean) %>%
count(concept, prefLabel, keywords_term_text, sort=TRUE) -> EDH_EAGLE_usage
EDH_EAGLE_usage
write_csv(EDH_EAGLE_usage, file = "./data/EDH_EAGLE_usage_typeinsc.csv", col_names = TRUE)
length(unique(EDH_eagle$keywords_term))
## [1] 21
https://www.eagle-network.eu/voc/typeins/lod/92.html
EDH_eagle %>%
filter(keywords_term == 92) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/143.html
EDH_eagle %>%
filter(keywords_term == 143) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/80.html
EDH_eagle %>%
filter(keywords_term == 80) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/311.html
EDH_eagle %>%
filter(keywords_term == 311) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/69.html
EDH_eagle %>%
filter(keywords_term == 69) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/261.html
EDH_eagle %>%
filter(keywords_term == 261) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/102.html
EDH_eagle %>%
filter(keywords_term == 102) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/115.html
EDH_eagle %>%
filter(keywords_term == 115) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/73.html
EDH_eagle %>%
filter(keywords_term == 73) %>%
count(keywords_term_text, sort=TRUE)
https://www.eagle-network.eu/voc/typeins/lod/91.html
EDH_eagle %>%
filter(keywords_term == 91) %>%
count(keywords_term_text, sort=TRUE)
EDH_eagle %>%
select(keywords_term, keywords_term_text) %>%
filter(keywords_term == "") %>%
count(keywords_term_text, sort=TRUE)
unique(EDH_eagle$keywords_term_text)
## [1] "Grabinschrift" "Ehreninschrift"
## [3] "Weihinschrift" "Defixio"
## [5] "Besitzer-/Herstellerinschrift" "Besitzer-/Herstellerinschrift?"
## [7] "Meilen-/Leugenstein" "Akklamation"
## [9] "Grenzmarkierung" "Bau-/Stifterinschrift"
## [11] "unbestimmt" "Weihinschrift?"
## [13] "Militärdiplom" "Bau-/Stifterinschrift?"
## [15] "Grabinschrift?" "Ehreninschrift?"
## [17] "Aufschrift" "Rechtliche Verfügung, öffentlich"
## [19] "Rechtliche Verfügung, privat" "Grenzmarkierung?"
## [21] "Beischrift" "Beischrift?"
## [23] "Verzeichnis" "Rechtliche Verfügung, privat?"
## [25] "Kalender" "Aufschrift?"
## [27] "Verzeichnis?" "Sitzinschrift"
## [29] "Elogium" "Assignationsinschrift"
## [31] "Sitzinschrift?" "Elogium?"
## [33] "Gebet" "Akklamation?"
## [35] "Defixio?" "Kalender?"
## [37] "Brief" "Meilen-/Leugenstein?"
## [39] "Adnuntiatio" "Rechtliche Verfügung, öffentlich?"
## [41] "Gebet?" "Brief?"
## [43] "Assignationsinschrift?" NA
## [45] "Militärdiplom?"
Attributes categorising the inscribed object, used 81130 times.
XML: Support tag, tag objectType.
Example of record in XML:
<support>
<objectType ref="http://www.eagle-network.eu/voc/objtyp/lod/257">Tafel</objectType>
</support>
EDH_eagle %>%
select(support_objecttype, support_objecttype_text) %>%
head()
length(unique(EDH_eagle$support_objecttype))
## [1] 40
http://www.eagle-network.eu/voc/objtyp/lod/2.html
EDH_eagle %>%
filter(support_objecttype == 2) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/257.html
EDH_eagle %>%
filter(support_objecttype == 257) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/29.html
EDH_eagle %>%
filter(support_objecttype == 29) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/250.html
EDH_eagle %>%
filter(support_objecttype == 250) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/140.html
EDH_eagle %>%
filter(support_objecttype == 140) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/57.html
EDH_eagle %>%
filter(support_objecttype == 57) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/189.html
EDH_eagle %>%
filter(support_objecttype == 189) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/35.html
EDH_eagle %>%
filter(support_objecttype == 35) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/214.html
EDH_eagle %>%
filter(support_objecttype == 214) %>%
count(support_objecttype_text, sort=TRUE)
http://www.eagle-network.eu/voc/objtyp/lod/89.html
EDH_eagle %>%
filter(support_objecttype == 89) %>%
count(support_objecttype_text, sort=TRUE)
EDH_eagle %>%
select(support_objecttype, support_objecttype_text) %>%
filter(support_objecttype == "") %>%
count(support_objecttype_text, sort=TRUE)
EDH_eagle %>%
select(support_objecttype, support_objecttype_text) %>%
count(support_objecttype, support_objecttype_text, sort=T)
unique(EDH_eagle$support_objecttype_text)
## [1] "Tafel" "Statuenbasis"
## [3] "Altar" "Stele"
## [5] "Tessera" "Urne"
## [7] "Block" "Barren"
## [9] "Meilen-/Leugenstein" "Stele?"
## [11] "unbestimmt" "Herme"
## [13] "Instrumentum domesticum" "Ziegel"
## [15] "Altar?" "Schmuck"
## [17] "Cippus" "Instrumentum militare"
## [19] "Tafel?" "Pflaster?"
## [21] "Sarkophag" "Basis"
## [23] "Architekturteil" "Statuenbasis?"
## [25] "Cippus?" "Architekturteil?"
## [27] "Cupa" "Statue"
## [29] "Platte" "Instrumentum sacrum"
## [31] "Grabbau" "Skulptur"
## [33] "Fels" "Block?"
## [35] "Relief" "Diptychon"
## [37] "Büste" "Instrumentum domesticum?"
## [39] "Ehren-/Grab-/Votivsäule" "Mensa"
## [41] "Bank" "Sarkophag?"
## [43] "Brunnen" "Ehren-/Votivbogen?"
## [45] "Waffe" "Urne?"
## [47] "Clipeus" "Ehren-/Votivbogen"
## [49] "Instrumentum sacrum?" "Pflaster"
## [51] "Olla" "Stadtbefestigung"
## [53] "Schmuck?" "Relief?"
## [55] "Basis?" "Meilen-/Leugenstein?"
## [57] "Ehren-/Grab-/Votivsäule?" "Grabbau?"
## [59] "Mensa?" "Instrumentum militare?"
## [61] "Tessera?" "Platte?"
## [63] "Herme?" "Ziegel?"
## [65] "Statue?" "Cupa?"
## [67] "Skulptur?" "Fels?"
## [69] "Diptychon?" "Büste?"
## [71] NA
Attributes categorising the inscribed material, used 77010 times.
XML: Support tag, material tag.
Example of record in XML (with missing Eagle LOD):
<support>
<material>Marmor, geädert / farbig</material>
</support>
EDH_eagle %>%
select(support_material, support_material_text) %>%
head()
length(unique(EDH_eagle$support_material))
## [1] 67
https://www.eagle-network.eu/voc/material/lod/138.html
EDH_eagle %>%
filter(support_material == 138) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/60.html
EDH_eagle %>%
filter(support_material == 60) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/75.html
EDH_eagle %>%
filter(support_material == 75) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/48.html
EDH_eagle %>%
filter(support_material == 48) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/131.html
EDH_eagle %>%
filter(support_material == 131) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/109.html
EDH_eagle %>%
filter(support_material == 109) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/108.html
EDH_eagle %>%
filter(support_material == 108) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/123.html
EDH_eagle %>%
filter(support_material == 123) %>%
count(support_material_text, sort=TRUE)
https://www.eagle-network.eu/voc/material/lod/115.html
EDH_eagle %>%
filter(support_material == 115) %>%
count(support_material_text, sort=TRUE)
Example:
<material>Marmor, geädert / farbig</material>
EDH_eagle %>%
select(support_material, support_material_text) %>%
filter(support_material == "") %>%
count(support_material_text, sort=TRUE)
unique(EDH_eagle$support_material_text)
## [1] "Marmor, geädert / farbig" "Marmor"
## [3] "Kalkstein" "unbestimmt"
## [5] "Travertin" "Blei, Zinn"
## [7] "Blei" "Granit"
## [9] "Bronze" "Ton"
## [11] "Alabaster" "Gold"
## [13] "Gesteine unbestimmt" "Sandstein"
## [15] "Zinn" "Gesteine"
## [17] "Marmor, weiß" "Schiefer"
## [19] "Holz" "Kalkmergel / Mergel"
## [21] "Konglomerat" "Bronze, Emaille"
## [23] "Kalkstein?" "Silber"
## [25] "Nenfro" "Bronze, Silber"
## [27] "Peperin" "Muschelkalk"
## [29] "Speckstein" "Basalt"
## [31] "Glas" "Tuff"
## [33] "Oolith" "Stein"
## [35] "Elfenbein" "Quarzit"
## [37] "Messing" "Marmor?"
## [39] "Bronze, Gold" "Gneis"
## [41] "Brekzie" "Trachyt"
## [43] "Ton, Muschelkalk" "Kalkstein, Stuck"
## [45] "Granit?" "Glas, Gold"
## [47] "Andesit" "Steatit"
## [49] "Holz, Wachs" "Knochen"
## [51] "Marmor geädert / farbig" "Blei?"
## [53] "Stein?" "Leder"
## [55] "Karneol" "Gold, Karneol"
## [57] "Leder, Silber" "Vulkantuff"
## [59] "Metalle" "Konglomerat, Blei"
## [61] "Eisen, Jaspis" "Marmor, farbig"
## [63] "Trachyte" "Gesteine?"
## [65] "Eisen" "Putz"
## [67] "Molasse" "Kalkstein, weiß"
## [69] "Gold, Silber" "Gold, Glas"
## [71] "Ton, Emaille" "Kupfer"
## [73] "Glas, Blei" "Kalktuff"
## [75] "Marmor, schwarz" "Bronze, Holz, Silber"
## [77] "Holt" "Blei, Eisen"
## [79] "magmatische Gesteine" "Silber, Gold"
## [81] "Bronze?" "Kalkstein, Bronze, Gold"
## [83] "Marmor, geädert/farbig" "Eisen, Gold, Silber"
## [85] "Sandstein, rötlich" "Kalksteine"
## [87] "Bronze, Silber?, Zinn?" "Marmor, Gold"
## [89] "Sandsteine" "Onyx"
## [91] "Syenit" "Porphyr"
## [93] "Sandstein, gelblich" "Dolomit"
## [95] "Dolomit?" "Sardonyx"
## [97] "Gold, Onyx" "Jaspis"
## [99] "Bronze, Onyx" "Achat"
## [101] "Heliotrop" "Bernstein, Gold"
## [103] "Lapislazuli" "Hämatit"
## [105] "Bernstein" "Sandstein, rot"
## [107] "Bronze, Gold, Kalkstein" "Sandstein?"
## [109] "metamorphe Gesteine" "Basalt, Sandstein"
## [111] "Elfenbein?" "Metall"
## [113] "Ton, Putz" "Gesteine; Blei"
## [115] "Broze" "Bronze, Blei"
## [117] "Bronze, Metalle, unbestimmt" "Kalkstein, Sandstein"
## [119] "Bronze, Zinn?, Silber?" "Stuck"
## [121] "Aplit" "Bronze, Eisen"
## [123] "Eisen, Kupfer" "Gagat"
## [125] "Opal" "Magnetit"
## [127] "Kohlenkalk" "Bronze, Gold, Granat"
## [129] "Travertin?" "Ton, Stuck"
## [131] "Lehm" "Bronze, Eisen, Silber"
## [133] "Bronze, Silber, Gold" "Blei, Bronze, Gold, Kalkstein"
## [135] "Serpentin" "Tuff, Stuck"
## [137] "Kupfer, Gold" "Bronze, Gold, Sandstein"
## [139] "Magmatische Gesteine" "Sandstein, Metall"
## [141] "Kalkstein, Bronze, Blei" "Kalkstein, Blei"
## [143] "Marmor, Granit" "Kalkstein, Marmor"
## [145] "Kupfer, Zinn" "Onyx, Gold"
## [147] "Bronze, Zinn" "Gesteine, Ton"
## [149] "Chrysopras" "Hematite"
## [151] "Zinn, Blei" "Eisen, Messing"
## [153] "Metall, Zinn" "Gold, Gesteine"
## [155] "Marmor (Farbe unbestimmt)" NA
## [157] "Elfenbein, Kupfer, Silber" "Spongolith"
## [159] "Bronze, Holz" "Eisen, Silber, Gold, Minerale"
## [161] "Feldspat" "Basalt, Bronze"
## [163] "Ton?" "Jaspis, Gold"
## [165] "Karneol, Silber" "Andesit?"
## [167] "Blei, Silber" "Minerale"
## [169] "Bronze, Gold." "Holz, Erle"
## [171] "Sandstein, Metalle" "Kalkstein, Bronze, Zinn"
## [173] "Kohlenkalk, schwarz" "Sandstein, weiß"
## [175] "Ryolith" "Sandstein, bräunlich"
## [177] "SIlber" "Marmor, weiß-blau"
## [179] "Eisen, Silber" "Eisen, Silber, Messing"
## [181] "Gesteine, Blei, Eisen" "Eisen, Gold?, Bronze?"
## [183] "Kreide" "Silber?"
## [185] "ton" "Holz, Bronze"
## [187] "Speckstein?" "Eisen, Bronze"
## [189] "Mörtel" "Kalsktein"
## [191] "Kalksstein" "Getseine"
## [193] "Lava" "Basaltlava"
## [195] "Klkstein" "Gestein"
## [197] "Silber, gold" "Kalstein"
Attributes categorising the execution of writing, used 80383 times.
XML: Layout tag, type execution.
Example of record in XML:
<layout>
<rs type="execution" ref="http://www.eagle-network.eu/voc/writing/lod/21">unbestimmt</rs>
</layout>
EDH_eagle %>%
select(layout_execution, layout_execution_text) %>%
head()
length(unique(EDH_eagle$layout_execution))
## [1] 9
https://www.eagle-network.eu/voc/writing/lod/21.html
EDH_eagle %>%
filter(layout_execution == 21) %>%
count(layout_execution_text, sort=TRUE)
https://www.eagle-network.eu/voc/writing/lod/88.html
EDH_eagle %>%
filter(layout_execution == 88) %>%
count(layout_execution_text, sort=TRUE)
https://www.eagle-network.eu/voc/writing/lod/77.html
EDH_eagle %>%
filter(layout_execution == 77) %>%
count(layout_execution_text, sort=TRUE)
https://www.eagle-network.eu/voc/writing/lod/96.html
EDH_eagle %>%
filter(layout_execution == 96) %>%
count(layout_execution_text, sort=TRUE)
https://www.eagle-network.eu/voc/writing/lod/145.html
EDH_eagle %>%
filter(layout_execution == 145) %>%
count(layout_execution_text, sort=TRUE)
https://www.eagle-network.eu/voc/writing/lod/23.html
EDH_eagle %>%
filter(layout_execution == 23) %>%
count(layout_execution_text, sort=TRUE)
https://www.eagle-network.eu/voc/writing/lod/152.html
EDH_eagle %>%
filter(layout_execution == 152) %>%
count(layout_execution_text, sort=TRUE)
EDH_eagle %>%
filter(layout_execution == "") %>%
count(layout_execution_text, sort=TRUE)
EDH_eagle %>%
select(layout_execution, layout_execution_text) %>%
filter(is.na(layout_execution)) %>%
count(layout_execution_text, sort=TRUE)
unique(EDH_eagle$layout_execution_text)
## [1] "unbestimmt"
## [2] "eingemeißelt (mit Farbresten)"
## [3] "gestempelt"
## [4] "Mosaik"
## [5] "gepunzt"
## [6] "geritzt / Graffito"
## [7] "gepunzt, geritzt / Graffito"
## [8] "appliziert, litterae aureae (von außen eingedübelt)"
## [9] "gemalt / Dipinto"
## [10] "eingraviert"
## [11] "appliziert, litterae aureae (in Buchstabenbettungen)"
## [12] "durchbrochen"
## [13] "gestempelt, geritzt / Graffito"
## [14] "appliziert, keine litterae aureae"
## [15] "geritzt / Graffito, eingraviert"
## [16] "gestempelt, gepunzt"
## [17] "gepunzt, eingraviert"
## [18] "geritzt / Graffito, appliziert, litterae aureae (von außen eingedübelt)"
## [19] "geritzt / Graffito, gemalt / Dipinto"
## [20] "appliziert, litterae aureae (in Buchstabenbettungen), appliziert, litterae aureae (von außen eingedübelt)"
## [21] "gemalt / Dipinto, geritzt / Graffito"
## [22] "gestempelt, gepunzt, geritzt / Graffito"
## [23] "gepunzt, appliziert, litterae aureae (in Buchstabenbettungen)"
## [24] "gestempelt, gemalt / Dipinto"
## [25] "geritzt / Graffito, eingemeißelt (mit Farbresten)"
## [26] NA
## [27] "gestempelt, gepunzt, eingraviert"
## [28] "gestempelt, eingraviert"
## [29] "gestempelt, geritzt / Graffito, gemalt / Dipinto"
Attributes categorising the type of decoration. In EDH very limited use - not the full potential of LOD, used 81130 times.
XML: Support, Rs tag, type decoration.
Example of record in XML:
<rs type="decoration" ref="http://www.eagle-network.eu/voc/decor/lod/1000">nein</rs>
https://www.eagle-network.eu/voc/decor/lod/1000.html = nein
https://www.eagle-network.eu/voc/decor/lod/2000.html, 2000 = ja
NA = missing information
length(unique(EDH_eagle$support_decoration))
## [1] 3
Johannes Gutenberg University in Mainz, petra.hermankova@uni-mainz.de, https://orcid.org/0000-0002-6349-0540↩︎